Basic Statistics

Raw Counts

Name Value
Rows 47,552
Columns 29
Discrete columns 17
Continuous columns 12
All missing columns 0
Missing observations 106,556
Complete Rows 1,360
Total observations 1,379,008
Memory allocation 32.2 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 13 columns ignored with more than 50 categories.
## EVENT_ID_CNTY: 47552 categories
## EVENT_DATE: 4330 categories
## ACTOR1: 1579 categories
## ASSOC_ACTOR_1: 7307 categories
## ACTOR2: 527 categories
## ASSOC_ACTOR_2: 2599 categories
## COUNTRY: 184 categories
## ADMIN1: 1889 categories
## ADMIN2: 7643 categories
## ADMIN3: 4643 categories
## LOCATION: 13977 categories
## SOURCE: 10524 categories
## NOTES: 47156 categories

QQ Plot

Correlation Analysis

## 13 features with more than 20 categories ignored!
## EVENT_ID_CNTY: 1360 categories
## EVENT_DATE: 940 categories
## ACTOR1: 222 categories
## ASSOC_ACTOR_1: 469 categories
## ACTOR2: 110 categories
## ASSOC_ACTOR_2: 412 categories
## COUNTRY: 49 categories
## ADMIN1: 278 categories
## ADMIN2: 773 categories
## ADMIN3: 980 categories
## LOCATION: 1094 categories
## SOURCE: 482 categories
## NOTES: 1323 categories

Principal Component Analysis

## 12 features with more than 50 categories ignored!
## EVENT_ID_CNTY: 1360 categories
## EVENT_DATE: 940 categories
## ACTOR1: 222 categories
## ASSOC_ACTOR_1: 469 categories
## ACTOR2: 110 categories
## ASSOC_ACTOR_2: 412 categories
## ADMIN1: 278 categories
## ADMIN2: 773 categories
## ADMIN3: 980 categories
## LOCATION: 1094 categories
## SOURCE: 482 categories
## NOTES: 1323 categories